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ABSTRACT 


A computer system includes one or more microprocessors. 
The microprocessors assign a priority level to each memory 
operation as the memory operations are initiated. In one 
embodiment, the priority levels employed by the micropro- 
cessors include a fetch priority level and a prefetch priority 
level. The fetch priority level is higher priority than the 
prefetch priority level, and is assigned to memory operations 
which are the direct result of executing an instruction. The 
prefetch priority level is assigned to memory operations 
which are generated according to a prefetch algorithm 
implemented by the microprocessor. As memory operations 
are routed through the computer system to main memory 
and corresponding data transmitted, the elements involved 
in performing the memory operations are configured to 
interrupt the transfer of data for the lower priority memory 
operation in order to perform the data transfer for the higher 
priority memory operation. While one embodiment of the 
computer system employs at least a fetch priority and a 
prefetch priority, the concept of applying priority levels to 
various memory operations and interrupting data transfers of 
lower priority memory operations to higher priority memory 
operations may be extended to other types of memory 
operations, even if prefetching is not employed within a 
computer system. For example, speculative memory opera- 
tions may be prioritized lower than non-speculative memory 
operations throughout the computer system. 

35 Claims, 4 Drawing Sheets 
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COMPUTER SYSTEM INCLUDING 
PRIORITIES FOR MEMORY OPERATIONS 
AND ALLOWING A HIGHER PRIORITY 
MEMORY OPERATION TO INTERRUPT A 
LOWER PRIORITY MEMORY OPERATION 5 

This application is a continuation of U.S. patent appli- 
cation Sen No. 08/982,588, filed on Dec. 2, 1997, now U.S. 
Pat. 6,058,461. 

BACKGROUND OF THE INVENTION 10 

1. Field of the Invention 

This invention is related to the field of computer systems 
and, more particularly, to memory latency issues within 
computer systems. 

2. Description of the Related Art 

Superscalar microprocessors achieve high performance 
by executing multiple instructions per clock cycle and by 
choosing the shortest possible clock cycle consistent with 
the design. On the other hand, superpipelined microproces- 2Q 
sor designs divide instruction execution into a large number 
of subtasks which can be performed quickly, and assign 
pipeline stages to each subtask. By overlapping the execu- 
tion of many instructions within the pipeline, superpipelined 
microprocessors attempt to achieve high performance. 25 

Superscalar microprocessors demand low memory 
latency due to the number of instructions attempting con- 
current execution and due to the increasing clock frequency 
(i.e. shortening clock cycle) employed by the superscalar 
microprocessors. Many of the instructions include memory 30 
operations to fetch (read) and update (write) memory oper- 
ands. The memory operands must be fetched from or con- 
veyed to memory, and each instruction must originally be 
fetched from memory as well. Similarly, superpipelined 
microprocessors demand low memory latency because of 35 
the high clock frequency employed by these microproces- 
sors and the attempt to begin execution of a new instruction 
each clock cycle. It is noted that a given microprocessor 
design may employ both superscalar and superpipelined 
techniques in an attempt to achieve the highest possible 40 
performance characteristics. 

Microprocessors are often configured into computer sys- 
tems which have a relatively large, relatively slow main 
memory. Typically, multiple dynamic random access 
memory (DRAM) modules comprise the main memory 45 
system. The large main memory provides storage for a large 
number of instructions and/or a large amount of data for use 
by the microprocessor, providing faster access to the instruc- 
tions and/or data than may be achieved from a disk storage, 
for example. However, the access times of modern DRAMs 50 
are significantly longer than the clock cycle length of 
modern microprocessors. The memory access time for each 
set of bytes being transferred to the microprocessor is 
therefore long. Accordingly, the main memory system is not 
a low latency system. Microprocessor performance may 55 
suffer due to high memory latency. 

In order to allow low latency memory access (thereby 
increasing the instruction execution efficiency and ulti- 
mately microprocessor performance), computer systems 
typically employ one or more caches to store the most 60 
recently accessed data and instructions. Additionally, the 
microprocessor may employ caches internally. A relatively 
small number of clock cycles may be required to access data 
stored in a cache, as opposed to a relatively larger number 
of clock cycles required to access the main memory. 65 

Low memory latency may be achieved in a computer 
system if the cache hit rates of the caches employed therein 
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are high. An access is a hit in a cache if the requested data 
is present within the cache when the access is attempted. On 
the other hand, an access is a miss in a cache if the requested 
data is absent from the cache when the access is attempted. 
Cache hits are provided to the microprocessor in a small 
number of clock cycles, allowing subsequent accesses to 
occur more quickly as well and thereby decreasing the 
effective memory latency. Cache misses require the access to 
receive data from the main memory, thereby increasing the 
effective memory latency. 

In order to increase cache hit rates, computer systems may 
employ prefetching to "guess" which data will be requested 
by the microprocessor in the future. The term prefetch, as 
used herein, refers to transferring data (e.g. a cache line) into 
a cache prior to a request for the data being received by the 
cache in direct response to executing an instruction (either 
speculatively or non-speculatively). A request is in direct 
response to executing the instruction if the definition of the 
instruction according to the instruction set architecture 
employed by the microprocessor includes the request for the 
data. A "cache line" is a contiguous block of data which is 
the smallest unit for which a cache allocates and deallocates 
storage. If the prefetched data is later accessed by the 
microprocessor, then the cache hit rate may be increased due 
to transferring the prefetched data into the cache before the 
data is requested. 

Unfortunately, prefetching can consume memory band- 
width at an inopportune time with respect to the occurrence 
of non-speculative memory operations. For example, a 
prefetch memory operation may be initiated just slightly 
prior to the initiation of a non-prefetch memory operation. 
As the prefetch memory operation is occupying the memory 
system already, the latency of the non-prefetch memory 
operation is increased by the amount of time the memory 
system is occupied with the prefetch request. Particularly if 
the prefetch is incorrect (i.e. the prefetched data is not used 
later by the requester), the increased latency may decrease 
performance of the microprocessor (and the overall com- 
puter system). 

SUMMARY OF THE INVENTION 

The problems outlined above are in large part solved by 
a computer system in accordance with the present invention. 
The computer system includes one or more microprocessors. 
The microprocessors assign a priority level to each memory 
operation as the memory operations are initiated. In one 
embodiment, the priority levels employed by the micropro- 
cessors include a fetch priority level and a prefetch priority 
level. The fetch priority level is higher priority than the 
prefetch priority level, and is assigned to memory operations 
which are the direct result of executing an instruction. The 
prefetch priority level is assigned to memory operations 
which are generated according to a prefetch algorithm 
implemented by the microprocessor. As memory operations 
are routed through the computer system to main memory 
and corresponding data transmitted, the elements involved 
in performing the memory operations are configured to 
interrupt the transfer of data for the lower priority memory 
operation in order to perform the data transfer for the higher 
priority memory operation. 

Advantageously, even though memory bandwidth is con- 
sumed by the prefetch memory operations, the latency 
experienced by the fetch memory operations may not be 
significantly impacted due to the interrupting of the prefetch 
memory operations to perform the fetch memory operations. 
Performance of the computer system may be increased due 
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to the lack of impact on the latency of the fetch memory BRIEF DESCRIPTION OF THE DRAWINGS 

operations by the prefetch ^^^^Z^^h Other objects and advantages of the invention will 

more aggressive prefetch algorithms (e.g. algorithms which , J ,. & . c „ . . t . 
generafe* more prefetch memory operations) may be become apparent upon reading the following detailed 
employed because the concerns regarding increasing the 5 description and upon reference to the accompanying draw- 
memory latency of non-prefetch memory operations i n g s m which: 

because of interference by the prefetch memory operations FIG. 1 is a block diagram of one embodiment of a 

is substantially allayed. The more aggressive prefetch algo- computer system. 

rithms may lead to increased prefetch effectiveness, further pIG. 2 is a block diagram of one embodiment of a bus 

decreasing overall effective memory latency. Performance 1Q brid shown in FIG x 

of the microprocessors employing the more aggressive , , _ , ' .' , . c ... 

prefetch algorithms may thereby by increased, and overall FIG - \ \ a ^chm illustrating operation of one embodi- 

performance of the computer system may accordingly be ™ aX of the bus bridge shown in FIGS. 1 and 2 upon 

improved. receiving a memory operation. 

While one embodiment of the computer system employs FIG. 4 is a timing diagram illustrating operation of certain 

at least a fetch priority and a prefetch priority, the concept of 15 signals upon an interface between the bus bridge shown in 

applying priority levels to various memory operations and FIGS. 1 and 2 and the main memory shown in FIG. 1, 

interrupting data transfers of lower priority memory opera- according to one embodiment of the bus bridge and the main 

lions to higher priority memory operations may be extended memory, for a memory operation. 

to other types of memory operations, even if prefetching is nG 5 ig a timi dia U i ustratin g operation of certain 

not employed within the computer system. For example, 20 ^ ^ between the ^ b shown m 

speculative memory operations may be prioritized lower * ^ 2 ^ ^ ^ m shown - n nG x 

than non-speculative .memory ^^J^^J^ according to one embodiment of the bus bridge and the main 

computer system. Performance of the computer system may & - _ a .7 t , , 

thereby be increased. memor y' for a first memor y °P eratl0n interrupted by a 

Broadly speaking, the present invention contemplates a 25 memory °P eratlon - 

method for transferring data in a computer system. A first While the invention is susceptible to various modifica- 

mcmory operation having a first priority is initiated. lions and alternative forms, specific embodiments thereof 

Subsequently, a second memory operation having a second are shown by way of example in the drawings and will 

priority is initialed. At least a portion of data corresponding herein be described in detail. It should be understood, 

to the first memory operation is transferred. The transferring 30 however, that the drawings and detailed description thereto 

is interrupted if the second priority is higher than the first are not intended to limit the invention to the particular form 

priority, and data corresponding to the second memory disclosed, but on the contrary, the intention is to cover all 

operation is transferred during the interruption. modifications, equivalents and alternatives falling within the 

The present invention further contemplates a computer spirit and scope of the present invention as defined by the 

system comprising at least one microprocessor, a main 35 appended claims, 
memory, and a bus bridge. The microprocessor is configured 

to initiate a first memory operation and to subsequently DETAILED DESCRIPTION OF THE 

initiate a second memory operation. Additionally, the micro- INVENTION 

processor is configured to assign a first priority to the first of onc embodimen , 

memory operation responsive to a first type of the first _ 6 _ , & 

memory operation, and to assign a second priority to the «° of a computer system 5 including one or more micropro- 

second memory operation responsive to a second type of the cessors ( e S- microprocessors 10A and 10B shown in FIG. 5) 

second memory operation. The main memory is configured C0U P led t0 a variet y of svstem components through a bus 

to store data including first data corresponding to the first bndge 12 is shown. Other embodiments are possible and 

memory operation and second data corresponding to the contemplated. In the depicted system, a mam memory 14 is 

second memory operation. Coupled between the micropro- 45 coupled to bus bridge 12 through a memory bus 16, and a 

cessor and the main memory, the bus bridge is configured to graphics controller 18 is coupled to bus bridge 12 through an 

initiate transfer of the first data from the main memory to the AGP bus 20. Finally, a plurality of PCI devices 22A-22B are 

microprocessor responsive to the first memory operation. coupled to bus bridge 12 through a PCI bus 24. A secondary 

Furthermore, the bus bridge is configured to interrupt trans- bus bridge 26 may further be provided to accommodate an 

fer of the first data upon receiving the second memory 50 electrical interface to one or more EISA or ISA devices 28 

operation if the second priority is higher than the first through an EISA/ISA bus 30. Microprocessors 10A and 10B 

priority. are coupled to bus bridge 12 through a CPU bus 34 and a 

Moreover, the present invention contemplates a bus priority line 38. Alternatively, independent buses may be 

bridge for a computer system, comprising a CPU interface coupled between bus bridge 12 and each of microprocessors 

block and a memory controller. The CPU interface block is 55 10A and 10B. As illustrated by the dotted illustration of 

coupled to receive bus operations from a microprocessor. microprocessor 10B, embodiments of computer system 5 

The bus operations include memory operations, and each employing only one microprocessor are contemplated, 

memory operation includes a priority assigned by an initia- Additionally, embodiments employing more than two 

tor of the memory operation. Coupled to the CPU interface microprocessors are contemplated. 

block and a memory, the memory controller is configured to 60 Generally speaking, microprocessors 10A and 10B are 

receive each memory operation and the priority from the configured to initiate memory operations upon CPU bus 34 

CPU interface block. The memory controller is configured to in order to transfer data to and from main memory 14. 

interrupt an in-progress memory operation to service a Microprocessors 10A and 10B assign a priority to each 

subsequent memory operation if a first priority correspond- memory operation, and transmit that priority concurrently 

ing to the in-progress memory operation is lower than a 65 with initiation of the memory operation. The assigned pri- 

second priority corresponding to the subsequent memory ority is transmitted via priority line 38. In one embodiment, 

operation. at least two priority levels are defined: a fetch priority level 
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and a prefetch priority level. The fetch priority level is 
assigned to memory operations which are the direct result of 
executing an instruction. These memory operations may be 
either read memory operations or write memory operations. 
The prefetch priority level is assigned to prefetch memory 5 
operations generated in accordance with the prefetch algo- 
rithm employed by microprocessors 10 A and 10B. Prefetch 
memory operations may be read memory operations. It is 
noted that microprocessors 10A and 10B may employ any 
suitable prefetch algorithm. A variety of well-known 
prefetch algorithms may be used, for example. 

Bus bridge 12 receives the memory operations initiated by 
microprocessors 10A and 10B, and transfers the data to/from 
main memory 14 via memory bus 16. The data is returned 
to the microprocessor 10A or 10B (if the data is a read 35 
memory operations) via CPU bus 34. Data is transferred to 
bus bridge 12 via CPU bus 34 for a write memory operation, 
and the data is subsequently transmitted to main memory 14 
via memory bus 16. 

If bus bridge 12 is in the process of performing a data 2 o 
transfer for a memory operation to main memory 14 and the 
memory operation is assigned a prefetch priority level, the 
read memory operation may be interrupted to perform a data 
transfer for another memory operation which is assigned a 
fetch priority level. Advantageously, prefetch memory 2 s 
operations (which are assigned the prefetch priority level) do 
not interfere with access to memory for fetch memory 
operations (i.e. memory operations assigned the fetch pri- 
ority level). The fetch memory operations may be completed 
with a latency similar to the latency that is experienced in the 30 
absence of prefetch memory operations. Even though the 
prefetch memory operations consume memory system 
bandwidth, they may not substantially increase the memory 
latency of the fetch memory operations. Performance of the 
microprocessors 10A and 10B (and overall performance of 35 
the computer system) may be increased by the lack of 
increase in the memory latency for fetch operations due to 
the occurrence of prefetch memory operations. Furthermore, 
the interruption of prefetch memory operations to perform 
higher priority fetch memory operations may allow for more 40 
aggressive prefetch algorithms to be employed within 
microprocessors 10A and 10B. Since the latency of fetch 
memory operations is substantially unaffected by the 
prefetch memory operations, more aggressive prefetching 
may be permissible. 45 

Subsequent to transferring data in response to the fetch 
memory operation, bus bridge 12 is configured to resume 
transferring data for the interrupted, lower priority memory 
operation. The lower priority memory operation is thereby 
completed. It is noted that the interruption of the transfer of 50 
data may occur upon memory bus 16 or upon CPU bus 34, 
depending upon the embodiment. For example, if CPU bus 
34 employs tagging to identify the address transfer of a 
memory operation with the corresponding data transfer, the 
tag may be conveyed with each data transfer on CPU bus 34. 55 
To interrupt a lower priority data transfer to perform a higher 
priority data transfer, the tag of the higher priority data 
transfer is conveyed. Subsequently, the tag of the lower 
priority data transfer is conveyed to complete the data 
transfer of the lower priority memory operation. 60 

In one embodiment, bus bridge 12 is configured to inter- 
rupt a data transfer to main memory 14 if the lower priority, 
in-progress memory operation and the higher priority 
memory operation are within the same "page". As used 
herein, a "page" refers to a block of data stored within the 65 
same row of the DRAMs which comprise main memory 14. 
The row is accessed via a row address provided by bus 
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bridge 12, and then the column address of the particular 
datum being addressed is provided (typically using the same 
address lines used to provide the row address). Additionally 
data within the row can be accessed by providing another 
column address without providing the row address again 
(referred to as a "page hit"). Reading or writing additional 
data from the same row in this manner (referred to as "page 
mode") may allow for lower latency access to the data, since 
the row address need not be provided in between each 
column access. 

By interrupting a lower priority memory operation to 
perform a higher priority memory operation in the same 
page only, the higher priority memory operation may be 
performed quickly (e.g. with a page hit timing). If a different 
page were accessed, then the current page would be deac- 
tivated and the new page accessed by providing the row 
address of the higher priority memory operation, then the 
corresponding column addresses. Subsequently, the new 
page would be deactivated and the page corresponding to the 
lower priority memory operation re-established. The time 
spent deactivating and activating pages may outweigh the 
latency savings for the higher priority memory operation. 

While the present disclosure may refer to the prefetch 
priority level and the fetch priority level for memory opera- 
tions (with the fetch priority level being a higher priority 
than the prefetch priority level), it is contemplated that other 
priority levels may be assigned for other purposes in other 
embodiments. Furthermore, even if prefetching is not 
employed, the assignment of priority levels to different types 
of memory operations may be advantageous. For example, 
speculative memory operations (performed due to the execu- 
tion of speculative instructions) might be assigned a lower 
priority level than non-speculative memory operations 
(performed due to the execution of non-speculative 
instructions). In this manner, speculative memory operations 
could be interrupted to perform non-speculative memory 
operations. Since the non-speculative memory operations 
have been confirmed as being required according to the 
execution of the program and the speculative memory 
operations may or may not be required, it may be advanta- 
geous to interrupt the speculative memory operations to 
decrease the latency of the non-speculative memory opera- 
tions. As another example, write back operations to update 
memory with updates made to a cache line within the cache 
of a microprocessor may be assigned a lower priority than 
memory operations to fill a cache line within the cache. 

As used herein, a "memory operation" is a transfer of data 
between an initiator and a memory (or a master and a slave, 
respectively). A "read memory operation" is a transfer of 
data from the slave (i.e. the memory) to the master. For 
example, microprocessor 10A or 10B may initiate a read 
memory operation to transfer data from main memory 14 to 
the microprocessor. A "write memory operation" is a transfer 
of data from the master to the slave (i.e. the memory). For 
example, microprocessor 10A or 10B may initiate a write 
memory operation to transfer data from the microprocessor 
to main memory 14. Memory operations may be of different 
sizes. However, memory operations to transfer data to and 
from the cache (e.g. prefetch memory operations and many 
fetch memory operations) may be performed using a cache 
line size. Generally, several transfer cycles (or "beats") on 
both memory bus 16 and CPU bus 34 are used to transfer a 
cache line of data. For example, four beats is a typical 
number to transfer a cache line. Interrupting a memory 
operation to perform a higher priority memory operation 
may comprise inserting the beats for the higher priority 
memory operation between two of the beats for the lower 


06/09/2004, EAST version: 1.4.1 


US 6,298,424 Bl 

7 8 

priority memory operation. To "initiate" a memory to generate PCI protocol transactions on AGP bus 20. The 
operation, at least the address of the memory operation is AGP interface of bus bridge 12 may thus include function- 
conveyed to the slave. Additional control information ality to support both AGP protocol transactions as well as 
(including, e.g. the priority level and the read/write nature of PCI protocol target and initiator transactions. Display 36 is 
the memory operation) may be conveyed concurrent with 5 any electronic display upon which an image or text can be 
the memory operation or using a predefined protocol with Pf^d. A suitable display 36 includes a i cathode ray tube 
respect to conveyance of the address. More than one bus C^RT"), a liquid crystal display ( LCD ), etc^ 
clock cycle may comprise initiating a memory operation, 11 * ™ted thal ' ^ hde the ?C *> " d IS , A or *! SA 
depending upon the protocol of CPU bus 34. Data may be buses have been used as examples in the above description, 
conveyed at a time subsequent to initiation of the memory 10 an V bus architectures may be substituted as desired, 
operation Turning now to FIG. 2, a block diagram of one embodi- 

t jj-.' * iL i j Mir- Vt . ment of bus bridee 12 and main memory 14 is shown in 

In addition to the above described functionality, bus » J . , 

. . . . j ' 4 c i-i • greater detail. Other embodiments are possible and contem- 

bndge 12 generally provides an interface between micro- & ... ^. . . . 

i,u j inr> u~ «~ plated. Only portions of bus bridge 12 pertaining to the 

processors 10A and 10B, main memory 14, graphics con- ^ . j- i u • ^ . ° „ 

"roller 18, and devices attached to PCI bus 24. When an « presen disclosure are shown in FIG. 2. Other portwns may 

operation is received from one of the devices connected to * tmplemented as destred. As shown m FIG. 2, bus bndge 

« t • j , . ■ , ^ . . .. fl „ t , . rtf a 12 includes a CPU interface block 40 and a mam memory 

bus bridge 12, bus bridge 12 identifies the target of the . „«k,j d „ 

. & , ■ i j * • *u fnpiw controller 42. Main memory controller 42 may include a 

operation (e.g. a particular device or, in the case of PCI bus / ' 

^ .u . t u\ ♦ • nm u ->^\ i-> request queue 44, a control unit 46, an open page/pnority 

24, that the target is on PCI bus 24). Bus bridge 12 routes the 4 7 • „ A ' . *\. ^^iL^in-i 

' . t r t * jj • t» u j 11 11 ?n storage 48, and a data buffer 50. CPU interface block 40 is 

operation to the targeted device. Bus bndge 12 generally 20 f, '™ TU j • •> r i fi aaa-^^u, 

4 1 * 4- r .u 1 1^ k„*JL coupled to CPU bus 34 and priority line 38. Additionally, 
translates an operation from the protocol used by the source *: . r / . ' 

1 . . , u * 1 Ik #1,^ f« a Z„ v~ ~r u„o CPU interface block 40 is coupled to main memory con- 

dev.ee or bus to the protocol used by he target devic or bus address/tag bus 52, a R/W line 54, a priority 

and routes the opera ton appropriately. Bus bndge 12 may * address/tag bus 52, 

further be responsible for coherency activity to ensure a R/W ^ e54 iofi | ne56 anddala/tagbus58 are 8 coupled 

proper result for the operation, etc. 25 ' t aa i i * * u eo • 1 i ( ~fi„t., 

K ^ r to request queue 44, and data/tag bus 58 is coupled to data 

In addition to providing an interface to an ISA/EISA bus buffer 50 Requesl queue 44y data bu n; er 50, and open 

from PCI bus 24, secondary bus bndge 26 may further p a g e /priority storage 48 are coupled to control unit 46. 

incorporate additional functionality, as desired. For Add i t i 0 nally, control unit 46 is coupled to an address and 

example, in one embodiment, secondary bus bridge 26 ^ contro l bus 16A and a data bus 16B which comprise memory 

includes a master PCI arbiter (not shown) for arbitrating bus 16 Dm buffef 50 is alsQ coupled t0 data bus 16B , Main 

ownership of PCI bus 24. An input/output controller (not mem0 ry 14 comprises a plurality of DRAM banks 

shown), cither external from or integrated with secondary 60 A-60N, Each DRAM bank 60A-60N comprises one or 

bus bridge 26, may also be included within computer system more DRA Ms, and each DRAM bank 60A-60N is coupled 

5 to provide operational support for a keyboard and mouse tQ memory bus i6 ^ DRA Ms included in main memory 
32 and for various serial and parallel ports, as desired. An 14 may comprise any type of DRAM, including standard 
external cache unit (not shown) may further be coupled to asynchronous DRAM, SDRAM, etc, 

CPU bus 34 between microprocessors 10A arid 10B and bus cpu block 40 fe confi ^ t0 reccivc bus 

bndge 12 in other embodiments. Alternatively, the externa ations from microprocessors 10A and 10B upon CPU 

cache may be coupled to bus bndge 12 and cache control ^ ^ {Q bu& tions cpu bus 34 in 

logic for the external cache may be integrated into bus tQ Qperalions reccived from Qther devices attached 

^ e thereto (e.g. coherency operations in response to memory 

Main memory 14 is a memory in which application accessed performed by other devices, etc.). If CPU interface 

programs are stored and from which microprocessors 10A block 40 receives a memory operation upon CPU bus 34, 

and 10B primarily execute. A suitable main memory 14 45 cpu i nte rface block 40 routes the address of the memory 

comprises DRAM(Dynamic Random Access Memory), operation and the corresponding tag from CPU bus 34 upon 

SDRAM (Synchronous DRAM), or RDRAM (RAMBUS address/tag bus 52 to main memory controller 42. 

DRAM). Additionally, the read/write nature of the memory operation 

PCI devices 22A-22B are illustrative of a variety of is conveyed via read/write line 54 and the corresponding 

peripheral devices such as, for example, network interface 50 priority (received upon priority line 38) is conveyed upon 

cards, video accelerators, audio cards, hard or floppy disk priority line 56. If the memory operation is a write memory 

drives or drive controllers, SCSI (Small Computer Systems operation, the corresponding data is conveyed via data/tag 

Interface) adapters and telephony cards. Similarly, ISA bus 58. 

device 28 is illustrative of various types of peripheral Request queue 44 stores the information provided by CPU 

devices, such as a modem, a sound card, and a variety of data 5S interface block 40. If request queue 44 is empty prior to 

acquisition cards such as GPIB or field bus interface cards. receipt of a memory operation and main memory 14 is idle, 

Graphics controller 18 is provided to control the rendering the memory operation may be selected by control unit 46 for 

of text and images on a display 36. Graphics controller 18 presentation to main memory 14. Additionally, if a memory 

may embody a typical graphics accelerator generally known operation is in-progress to main memory 14, the memory 

in the art to render three-dimensional data structures which 60 operation being received is to the same page as the 

can be effectively shifted into and from main memory 14. in-progress memory operation, and the priority level 

Graphics controller 18 may therefore be a master of AGP assigned to the memory operation is higher than the priority 

bus 20 in that it can request and receive access to a target level of the in-progress memory operation, control unit 46 is 

interface within bus bridge 12 to thereby obtain access to configured to interrupt the in-progress memory operation to 

main memory 14, A dedicated graphics bus accommodates 65 perform the memory operation. After completing the 

rapid retrieval of data from main memory 14. For certain memory operation, control unit 46 may resume the 

operations, graphics controller 18 may further be configured in-progress memory operation. 
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Generally, control unit 46 may select memory operations 
from request queue 44 in order. However, a higher priority 
memory operation (as indicated via the priority assigned to 
the memory operation by the master for the memory 
operation) may be selected over a prior, lower priority 5 
memory operation (if the lower priority memory operation is 
to a different address than the higher priority memory 
operation). Control unit 46 conveys the address of the 
selected memory operation to main memory 14 along with 
corresponding control information via address and control 
bus 16A In the present embodiment, the control information 
includes a write enable line to indicate that the operation is 
a read or write, a row address strobe (RAS_) line to indicate 
that the row portion of the address is being conveyed, and a 
column address strobe (CAS_) line to indicate that the 15 
column address is being conveyed. Other control lines may 
be included as well (for example, bank select lines for 
selecting which bank or banks is to respond to an access, 
etc.). The row portion of the address is conveyed first, along 
with an activation of the RAS_Jine. Subsequently, the 2Q 
column portion of the address is conveyed along with an 
activation of the CAS_line. If the operation is a read, the 
selected data is provided by main memory 14 upon data bus 
16B. Data buffer 50 may capture the data from the data bus 
under the direction of control unit 46. If the operation is a 25 
write, the write data is driven by control unit 46 upon data 
bus 16B. 

Typically, multiple beats are performed to complete a 
memory operation (e. g. to read or write a cache line of data). 
The page mode feature of the DRAMs forming DRAM 30 
banks 60A-60N is typically used to transfer the data. In 
other words, subsequent beats are performed by presenting 
the corresponding column addresses (utilizing the same row) 
and transferring the data. Certain types of SDRAMs may be 
configured to burst data from a particular set of columns in 35 
response to a particular column address over a number of 
cycles, such that the beats are provided in response to a 
single row and column address. Such SDRAMs may support 
an interruption command to allow the bursts to be 
interrupted, and the command may be used by control unit 40 
46 when such interruption is desired to perform a higher 
priority memory operation. 

Control unit 46 records the page (e.g. the row portion of 
the address) and the priority level of the in-progress memory 
operation in open page/priority storage 48. If a higher 45 
priority memory operation is detected (by comparing the 
priority level stored in open page/priority storage 48 to the 
priority level of incoming memory operations), and if the 
incoming memory operation is to the same page as the 
in-progress memory operation (detected by comparing the 50 
page recorded in open page/priority storage 48), then the 
in-progress data transfer may be interrupted to perform the 
higher priority memory operation. The column portion of the 
address of the higher priority memory operation is 
presented, and the corresponding data transferred. 55 
Subsequently, the lower priority (formerly in-progress) 
memory operation is completed by presenting the column 
portion of the address corresponding to the next data to be 
transferred, and by transferring the remaining data. 

In one embodiment, DRAM banks 60A-60N may be go 
accessed independently to allow multiple memory opera- 
tions to be performed between main memory 14 and main 
memory controller 42. For such an embodiment, multiple 
open pages and corresponding priority levels may be stored 
in open page/priority storage 48. 65 

Data buffer 50 may assemble the data from main memory 
14 for read memory operations. Once the data has been 
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assembled, the data is transferred to CPU interface 40 along 
with the corresponding tag presented when the read memory 
operation was transferred to main memory controller 42. 
CPU interface block 40 returns the data upon CPU bus 34, 
using the tag to identify the data thereon. 

It is noted that request queue 44 may be implemented as 
separate read and write queues, as desired. Alternatively, 
data buffer 50 may buffer both write data for subsequent 
conveyance upon data bus 16B and read data for subsequent 
conveyance on CPU bus 34. It is noted that data buffer 50 
and/or request queue 44 may be eliminated in various 
embodiments in favor of queuing in CPU interface block 40. 
It is still further noted that, if a lower priority write operation 
is interruptible in a given embodiment, the lower priority 
operation may not be interrupted if the higher priority 
memory operation overlaps with the lower priority memory 
operation. 

Turning next to FIG. 3, a flowchart is shown illustrating 
certain operations of one embodiment of bus bridge 12 upon 
receiving a memory operation. Other embodiments are pos- 
sible and contemplated. As illustrated at step 70, the memory 
operation is received. The memory operation is termed "new 
memory operation" to distinguish it from the in-progress 
memory operation within the flowchart of FIG. 3. 

Bus bridge 12 determines if the new memory operation is 
higher priority than the in-progress memory operation 
(decision block 72). If the new memory operation is the 
same or lower priority, then the in-progress memory opera- 
tion continues (step 74). The new memory operation is 
queued for subsequent presentation to main memory 14, 

On the other hand, if the new memory operation is higher 
priority than the in-progress memory operation, bus bridge 
12 determines if the new memory operation is in the same 
page as the in-progress memory operation (decision block 
76). If the new memory operation is in a different page, the 
in-progress memory operation continues (step 74). The new 
memory operation is queued for subsequent presentation to 
main memory 14. 

Conversely, if the new memory operation is to the same 
page as the in-progress memory operation, the in-progress 
memory operation is interrupted (step 78). The new memory 
operation is then presented to main memory 14 and com- 
pleted (step 80), and the in-progress memory operation is 
resumed at the point of interruption (step 82). 

It is noted that, in another embodiment, another factor 
which may be considered in determining if the in-progress 
memory operation is interrupted is the type of memory 
operation (read/write). If the new memory operation is a 
different type than the in-progress operation, then it may be 
desirable not to interrupt the in-progress operation. A certain 
amount of time to disable and enable the driving of data bus 
16B by main memory 14 and bus bridge 12 may be needed 
in the case of different types. It is further noted that, although 
the steps in the flowchart of FIG. 3 are shown serially in a 
particular order for ease of understanding, any order which 
achieves a similar result may be used. Furthermore, many of 
the steps may be performed in parallel in the combinatorial 
logic forming bus bridge 12. 

Turning next to FIG. 4, a timing diagram illustrating 
operation of certain signals included in address and control 
bus 16A and data bus 16B in one embodiment of computer 
system 5 for a read memory operation is shown. Other 
embodiments are possible and contemplated. The embodi- 
ment shown in FIG. 4 corresponds to standard asynchronous 
DRAMs. 

A RAS_signal and a CAS_signal are illustrated in FIG. 
4. To begin a memory operation upon memory bus 16, the 
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RAS_signal is activated (driven to a logical low value). 
Upon activation of the RAS__signal, the DRAM(s) capture 
the row address (illustrated as RA1 on the address lines 
ADDR in FIG. 4). In FIG. 4, the correlation between 
activation of the RAS_signal and the capture of the row 5 
address is illustrated by arrow 90. 

Subsequently, the column address for the first beat of data 
is provided. The DRAM(s) capture the column address for 
the first beat of data (illustrated as CA10 on the address lines 
ADDR in FIG. 4) upon activation of the CAS_signal. The 10 
correlation between activation of the CAS_signal and the 
capture of the column address is illustrated via arrow 92. 
Similarly, subsequent beats are accessed by providing addi- 
tional column addresses CA11, CA12, and CA13 upon 
subsequent activations of the CAS_signal (the correlation 35 
of which is shown by arrows 94, 96, and 98). 

Data corresponding to each column address is returned 
(the correlation to column addresses is indicated by arrows 
100, 102, 104, and 106. Hence data DA10, DA11, DA12, 
and DA13 is returned corresponding to column addresses 20 
CA10, CA 11, CA12, and CA13, respectively. After data 
DA13 corresponding to column address CA13 is received, 
the memory operation is complete in the present example. 

FIG. 5 is a timing diagram illustrating operation of certain 25 
signals included in address and control bus 16A and data bus 
16B in one embodiment of computer system 5 for a first read 
memory operation interrupted by a second, higher priority 
read memory operation. Other embodiments are possible 
and contemplated. The embodiment shown in FIG. 5 cor- 3Q 
responds to standard asynchronous DRAMs. 

Similar to FIG. 4, a row address RA1 corresponding to the 
first read memory operation is provided upon activation of 
the RAS_signal (the correlation to which is indicated by 
arrow 90). Subsequently, column addresses CA10 and CA11 35 
are presented on consecutive assertions of the CAS_signal 
(the correlation of which is indicated by arrows 92 and 94). 
Corresponding data DA10 and DA11 is returned corre- 
sponding to CA10 and CA11, as indicated by arrows 100 
and 102. 40 

However, the second memory operation is received and is 
determined to be to the same page as the first memory 
operation and is determined to be higher priority than the 
first memory operation. Accordingly, the first memory 
operation is interrupted. The next column address to be 45 
presented (upon the next assertion of the CAS_signal, the 
correlation of which is shown by arrow 110) after CA11 is 
CA20, the first column address for the second memory 
operation. At subsequent assertions of the CAS_signal, the 
remaining column addresses for the second memory opera- 50 
tion are provided (CA21, CA22, and CA23 indicated by 
arrows 112, 114, and 116). Since the first and second 
memory operation are within the same page, it is sufficient 
to provide the column addresses for the second memory 
operation to retrieve data for the second memory operation. 55 

Accordingly, data corresponding to column addresses 
CA20, CA21, CA22, and CA23 is provided (DA20, DA21, 
DA22, and DA23, respectively, the correlation of which is 
indicated by arrows 118, 120, 122 and 124). The transfer of 
the second memory operation is complete upon return of 60 
data DA23 in the present example. Subsequently, the data 
transfer for the first memory operation is resumed by pre- 
senting the column address CA12 upon the next assertion of 
the CAS_signal (the correlation of which is indicated by 
arrow 96) Data corresponding to the column address CA12 65 
is accordingly returned (data DA12, the correlation to 
address CA12 being indicated by arrow 104). Column 


address CA13 is presented (not shown) to complete the data 
transfer for the first memory operation. 

In accordance with the above disclosure, a computer 
system has been shown in which a priority level is assigned 
to each memory operation by the initiator of the memory 
operation. A low priority level is assigned to prefetch 
memory operations, while fetch memory operations receive 
a high priority level. If a prefetch memory operation is 
in-progress and a fetch memory operation to the same page 
as the prefetch memory operation is detected, then the 
prefetch memory operation is interrupted to perform the 
fetch memory operation. The latency of the fetch memory 
operation may not be adversely affected by the existence of 
the prefetch memory operation. Overall performance of the 
computer system may thereby be increased. 

Numerous variations and modifications will become 
apparent to those skilled in the art once the above disclosure 
is fully appreciated. It is intended that the following claims 
be interpreted to embrace all such variations and modifica- 
tions. 

What is claimed is: 

1. A computer system comprising: 

one or more microprocessors, wherein said one or more 
microprocessors are configured to initiate a first 
memory operation and to subsequently initiate a second 
memory operation, and wherein said one or more 
microprocessors are configured to assign a first priority 
to said first memory operation, and wherein said one or 
more microprocessors are configured to assign a second 
priority to said second memory operation; 

a main memory configured to store data including a first 
data corresponding to said first memory operation and 
a second data corresponding to said second memory 
operation; and 

a bus bridge coupled to said one or more microprocessors 
and to said main memory, wherein said one or more 
microprocessors are configured to convey said first 
memory operation and to concurrently convey said first 
priority to said bus bridge, and wherein said one or 
more microprocessors are further configured to convey 
said second memory operation and to concurrently 
convey said second priority to said bus bridge, and 
wherein said bus bridge is configured to initiate a 
transfer of said first data responsive to said first 
memory operation, and wherein said bus bridge is 
configured to interrupt said transfer of said first data in 
response to said second memory operation if said 
second priority is higher than said first priority, and 
wherein said bus bridge is configured to inhibit inter- 
ruption of transferring said first data if said first 
memory operation and said second memory operation 
are in different pages of said main memory. 

2. The computer system as recited in claim 1 wherein said 
first memory operation is a read and said second memory 
operation is a write, and wherein said second priority is 
higher than said first priority, 

3. The computer system as recited in claim 1 wherein said 
one or more microprocessors are configured to assign each 
of said first priority and said second priority to one of at least 
two priority levels. 

4. The computer system as recited in claim 3 wherein said 
at least two priority levels include a prefetch priority level 
and a fetch priority level. 

5. The computer system as recited in claim 4 wherein said 
fetch priority level is higher than said prefetch priority level. 

6. The computer system as recited in claim 3 wherein said 
at least two priority levels include a non-speculative priority 
level and a speculative priority level. 
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7. The computer system as recited in claim 6 wherein said 
non-speculative priority level is higher than said speculative 
priority level. 

8. The computer system as recited in claim 3 wherein said 

at least two priority levels includes a fill priority level and a 5 
write back priority level. 

9. The computer system as recited in claim 8 wherein said 
fill priority level is higher than said write back priority level. 

10. The computer system as recited in claim 1 wherein 
said first memory operation is initiated by a first micropro- Q 
cessor of said one or more microprocessors and wherein said 
second memory operation is initiated by a second micro- 
processor of said one or more microprocessors. 

11. The computer system as recited in claim 1 wherein 
said first memory operation and said second memory opera- 35 
tion are initiated by a same one of said one or more 
microprocessors. 

12. The computer system as recited in claim 1 wherein 
said bus bridge is configured to transfer said second data 
while transfer of said first data is interrupted. 2Q 

13. The computer system as recited in claim 12 wherein 
said bus bridge is configured to resume transferring said first 
data upon completing transfer of said second data. 

14. The computer system as recited in claim 1 wherein 
said bus bridge is configured to interrupt a transfer of said 25 
first data between said main memory and said bus bridge. 

15. The computer system as recited in claim 1 wherein 
said first data comprises a plurality of beats of data, each of 
said plurality of beats of data transferred at a different time, 
and wherein interrupting said transfer is performed between 3Q 
two beats of the plurality of beats. 

16. A method for transferring data in a computer system, 
comprising: 

receiving a first memory operation and concurrently 
receiving a first priority of said first memory operation, 35 
said first priority assigned by a microprocessor initiat- 
ing said first memory operation; 

receiving a second memory operation and concurrently 
receiving a second priority of said second memory 
operation, said second priority assigned by a micropro- 40 
cessor initiating said second memory operation, said 
receiving and said concurrently receiving being subse- 
quent to said receiving said first memory operation; 

transferring data corresponding to said first memory 
operation; and 45 

interrupting said transferring if said second priority is 
higher than said first priority, and transferring data 
corresponding to said second memory operation during 
said interrupting, wherein said interrupting is inhibited 
if a first memory address corresponding to said first 50 
memory operation is in a different page than a second 
memory address corresponding to said second memory 
operation. 

17. The method as recited in claim 16 wherein said first 
memory operation is a read and said second memory opera- 55 
tion is a write, and wherein said second priority is higher 
than said first priority. 

18. The method as recited in claim 16 further comprising 
transferring a remaining portion of said data corresponding 

to said first memory operation subsequent to said interrupt- 60 
ing. 

19. The method as recited in claim 16 wherein said data 
corresponding to said first memory operation comprises a 
plurality of beats of data, each of said plurality of beats of 
data transferred at a different time, and wherein said inter- 65 
rupting is performed between two beats of the plurality of 
beats. 
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20. A bus bridge for a computer system, comprising: 

a CPU interface block coupled to receive memory 
operations, said CPU interface block further coupled to 
receive a corresponding priority for each memory 
operation, said priority assigned by one or more micro- 
processors initiating said memory operation; and 

a memory controller coupled to said CPU interface block 
and a memory, wherein said memory controller is 
configured to receive said each memory operation and 
said corresponding priority from said CPU interface 
block, and wherein said memory controller is config- 
ured to interrupt an in -progress memory operation to 
service a subsequent memory operation if a first prior- 
ity corresponding to said in-progress memory operation 
is lower than a second priority corresponding to said 
subsequent memory operation, and wherein said 
memory controller is configured to inhibit interrupt of 
said in-progress memory operation if said in-progress 
memory operation is to a different page of said memory 
than said subsequent memory operation. 

21. The bus bridge as recited in claim 20 wherein said 
memory controller is configured to resume said in-progress 
memory operation after servicing said subsequent memory 
operation. 

22. The bus bridge as recited in claim 20 wherein said 
in-progress memory operation is a read and said subsequent 
memory operation is a write, and wherein said second 
priority is higher than said first priority. 

23. The bus bridge as recited in claim 20 wherein inter- 
rupting said in-progress memory operation is performed 
between a transfer of a first beat of a plurality of beats of data 
accessed in response to said in-progress memory operation 
and a transfer of a second beat of a plurality of beats of data 
accessed in response to said in-progress memory operation. 

24. A bus bridge for a computer system, comprising: 

a CPU interface block coupled to receive memory opera- 
tions; and 

a memory controller coupled to said CPU interface block 
and a memory, wherein said memory controller is 
configured to receive a first memory operation from 
said CPU interface block, and wherein said memory 
controller is configured to interrupt an in-progress 
memory operation to service said first memory opera- 
tion if said first memory operation is higher priority 
than said in-progress memory operation, and wherein 
the memory controller is configured to inhibit interrupt 
of said in-progress memory operation if said 
in-progress memory operation is to a different page of 
said memory than said first memory operation. 

25. The bus bridge as recited in claim 24 wherein said 
CPU interface block is further coupled to receive a corre- 
sponding priority for each memory operation, said priority 
assigned by one or more microprocessors initiating said 
memory operation. 

26. The bus bridge as recited in claim 24 wherein said 
memory controller is configured to resume said in-progress 
memory operation after servicing said subsequent memory 
operation. 

27. The bus bridge as recited in claim 24 wherein said 
in-progress memory operation is a read and said first 
memory operation is a write. 

28. The bus bridge as recited in claim 24 wherein said 
in-progress memory operation is a prefetch and said first 
memory operation is a fetch. 

29. The bus bridge as recited in claim 24 wherein said 
in-progress memory operation is speculative and said first 
memory operation is non-speculative. 
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30. The bus bridge as recited in claim 24 wherein said 
in-progress memory operation is a writeback and said first 
memory operation is a fill. 

31. The bus bridge as recited in claim 24 wherein said 
memory controller is configured to transfer data correspond- 
ing to said first memory operation while said in-progress 
memory operation is interrupted. 

32. A method for transferring data in a computer system, 
comprising: 

receiving a first memory operation; 

receiving a second memory operation subsequent to said 
receiving said first memory operation; 

transferring data corresponding to said first memory 
operation; and 

interrupting said transferring if said second memory 
operation is higher priority than said first memory 
operation, and transferring data corresponding to said 
second memory operation during said interrupting, and 
wherein said interrupting is inhibited if a first memory 2 o 
address corresponding to said first memory operation is 
in a different page than a second memory address 
corresponding to said second memory operation. 
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33. The method as recited in claim 32 wherein said first 
memory operation is a read and said second memory opera- 
tion is a write. 

34. The method as recited in claim 32 further comprising 
transferring a remaining portion of said data corresponding 
to said first memory operation subsequent to said interrupt- 
ing. 

35. The method as recited in claim 32 further comprising 
assigning a first priority to said first memory operation by 

a microprocessor initiating said first memory operation 
and conveying said first priority; 

receiving the first priority concurrent with said receiving 
said first memory operation; 

assigning a second priority to said second memory opera- 
tion by a microprocessor initiating said second memory 
operation and conveying said second priority; and 

receiving the second priority concurrent with said receiv- 
ing said second memory operation. 
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